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The background for the general mathematical link between util¬ 
ity and information theory investigated in this paper is a simple fi¬ 
nancial market model with two kinds of small traders: less informed 
traders and insiders, whose extra information is represented by an 
enlargement of the other agents’ filtration. The expected logarithmic 
utility increment, that is, the difference of the insider’s and the less 
informed trader’s expected logarithmic utility is described in terms 
of the information drift, that is, the drift one has to eliminate in or¬ 
der to perceive the price dynamics as a martingale from the insider’s 
perspective. On the one hand, we describe the information drift in a 
very general setting by natural quantities expressing the probabilistic 
better informed view of the world. This, on the other hand, allows us 
to identify the additional utility by entropy related quantities known 
from information theory. In particular, in a complete market in which 
the insider has some fixed additional information during the entire 
trading interval, its utility increment can be represented by the Shan¬ 
non information of his extra knowledge. For general markets, and in 
some particular examples, we provide estimates of maximal utility by 
information inequalities. 

0. Introduction. A simple mathematical model of two agents on a finan¬ 
cial market taking their portfolio decisions on the basis of different informa¬ 
tion horizons has attracted much attention in recent years. Both agents are 
small, and unable to influence the price dynamics of the risky assets con¬ 
stituting the market. One agent just acts on the basis of the evolution of 
the market, the other one, the insider, possesses some additional knowledge 
at every instant of the continuous trading interval. This basic fact is mod¬ 
eled by associating two different filtrations with each agent, from which they 
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make their portfolio decisions: the less informed agent, at time t, just has 
the cr-field J-t, corresponding to the natural evolution of the market up to 
this time, at his disposal for deciding about future investments, while the 
insider is able to make better decisions, taking his knowledge from a big¬ 
ger cr-field Qt D Tt- We give a short selection from among the many papers 
dealing with this model, just indicating the most important mathematical 
techniques used for its investigation. Methods are focused on martingale 
and stochastic control theory, and techniques of enlargement of filtrations 
(see [21]), starting with the conceptual paper by Duffie and Huang [10], 
mostly in the initial enlargement setting, that is, the insider gets some fixed 
extra information at the beginning of the trading interval. The model is 
successively studied on stochastic bases with increasing complexity: that is, 
Karatzas and Pikovsky [23] on Wiener space, Grorud and Pontier [14] allow 
Poissonian noise and Biagini and Oksendal [6] employ anticipative calculus 
techniques. In the same setting, Amendinger, Becherer and Schweizer [1] 
calculate the value of insider information from the perspective of specific 
utilities. Baudoin [5] introduces the concept of weak additional information 
consisting in the knowledge of the law of some random element. Campi [7] 
considers hedging techniques for insiders in the incomplete market setting. It 
is clear that the expected utility the insider is able to gain from final wealth 
in this simple model will be bigger than the uninformed traders’ utility, for 
every utility function. And, in fact, many of the quoted papers deal with the 
calculation of a better informed agent’s additional utility. 

In [3] , in the setting of initial enlargements and logarithmic utility, a cru¬ 
cial and natural link between the additional expected logarithmic utility and 
information theoretic concepts was made. The insider’s logarithmic utility 
advantage is identified with the Shannon entropy of the additional infor¬ 
mation. In the same setting, Gasbarra and Valkeila [15] extended this link 
by interpreting the logarithmic utility increment by the Kullback-Leibler 
information of the insider’s additional knowledge from the perspective of 
Bayesian modeling. In the environment of this utility-information paradigm, 
the papers [2, 8, 17, 18, 19, 20] describe additional utility and treat arbitrage 
questions and their interpretation in information theoretic terms in increas¬ 
ingly complex models of the same base structure, including some simple 
examples of progressive enlargements. It is clear that utility concepts differ¬ 
ent from the logarithmic one correspond on the information theoretic side 
to the generalized entropy concepts of /-divergences. 

In this paper we shall continue the investigation of mathematical ques¬ 
tions related to the link between utility and information theory in the most 
general setting of enlargements of filtrations: besides assuming eventually 
that the base space be standard, to ensure the existence of regular condi¬ 
tional probabilities, we shall let the filtration of the better informed agent 
just contain the one of the natural evolution of knowledge. To concentrate 
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on one kind of entropy in this general setting, we shall consider logarithmic 
utility throughout. In this framework, Ankirchner and Imkeller [2] calculate 
the maximal expected utility of an agent from the intrinsic point of view 
of his (general) filtration, and relate the finiteness of expected utility via 
the (NFLVR) condition to the characterization of semimartingales by the 
theorem of Dellacherie-Meyer-Mokobodski. The compensator in the Doob- 
Meyer decomposition of underlying asset price processes with respect to the 
agent’s filtration is determined by the information drift process. In this paper 
we shall give a general analysis of the nature of this process, and relate it to 
measuring the difference of the information residing in the two filtrations, 
independently of the particular price dynamics. The basic observation we 
start with in Section 2 identifies the information drift process with Radon— 
Nikodym densities of the stochastic kernel in an integral representation of 
the conditional probability process and the conditional probability process 
itself. This observation allows for an identification of the additional utility 
by the information difference of the two filtrations in terms of Shannon en¬ 
tropy notions in Section 5, again independent of particular price dynamics 
of the financial market. 

The paper is organized as follows. In the preparatory Section 1 we re¬ 
call the main results about the connection between finite utility filtrations, 
properties of the price dynamics from the perspective of different agents 
and properties of the information drift from [2]. In Section 2 (Theorems 2.6 
and 2.10) properties of the conditional probability processes with respect to 
the agents’ filtrations and the information drift process are investigated in 
depth, and lead to the identification of the information drift by subjective 
conditional probability quantities. The description of the additional utility 
in terms of entropy notions is more easily obtained, if the additional infor¬ 
mation in the bigger filtration comes in discrete bits along a sequence of 
partitions of the trading interval, leading to stepwise “initial enlargements” 
which ultimately converge to the big filtration as the mesh of the parti¬ 
tions shrinks to 0. This is done in Section 5 (Theorem 5.8), after being 
prepared in Sections 3 and 4 by a general investigation of the convergence 
properties of information drifts going along with the convergence of such 
discretized enlargements to the big filtration. In the final Section 6 general 
facts known from Shannon information theory (see [16]) are applied to esti¬ 
mate the expected maximal logarithmic utility of a better informed agent via 
the identification theorem of Section 5, in several particular cases. Entropy 
maximizing properties of Gaussian random variables play an important role. 

1. Preliminaries. In this preparatory section we define the financial mar¬ 
ket model and recall some basic facts about expected utility maximization. 
Our favorite utility function will be the logarithmic one, for which we will 
then compare the maximal expected utilities of agents on the market who 


4 


S. ANKIRCHNER, S. DEREICH AND P. IMKELLER 


act on the background of asymmetric information. Recalling a result from 
[2], we will describe the utility increment of a better informed agent by the 
respective information drift of the agents’ filtrations. 

Let (f2, JF, P) be a probability space with a filtration (, Pt)o<t<T, where T > 

0 is a fixed time horizon. We consider a financial market with one nonrisky 
asset of interest rate normalized to 0, and one risky asset with price St at 
time t £ [0,T]. We assume that S' is a continuous (.?})-semimartingale with 
values in R and write A for the set of all S-integrable and (J~t )-predictable 
processes such that 9q = 0. If 9 £ A, then we denote by (9 ■ S ) the usual 
stochastic integral process. For all x > 0, we interpret 

x + (9 ■ S)t, 0 <t<T, 

as the wealth process of a trader possessing an initial wealth x and choos¬ 
ing the investment strategy 9 on the basis of his knowledge horizon corre¬ 
sponding to the filtration (ft)- Throughout this paper we will suppose the 
preferences of the agents to be described by the logarithmic utility function. 
Furthermore, we suppose that the traders’ total wealth has always to be 
strictly positive, that is, for all t £ [0, T], 

(1) x + (9 ■ S)t > 0 a.s. 

Strategies 9 satisfying equation (1) will be called x-superadmissible. The 
agents want to maximize their expected logarithmic utility from their wealth 
at time T. So we are interested in the exact value of 

u(x) = sup{£ , log(x + ( 9 ■ S)t ): 9 £ A x-superadmissible}. 

Sometimes we will write ujf(x) in order to stress the underlying filtration. 
The expected logarithmic utility of the agent can be calculated easily, if one 
has a semimartingale decomposition of the form 

(2) S t = M t + f\sd(M,M) s , 

Jo 

where rj is a predictable process. Such a decomposition is given for a large 
class of semimartingales. For example, if S satisfies the property (NFLVR), 
then it may be decomposed as in (2) (see [12]). As is shown in a forthcoming 
Ph.D. thesis [4], finiteness of u(x) implies already such a decomposition to 
exist. Hence, a decomposition as in (2) may be given even in cases where 
arbitrage exists. We state Theorem 2.9 of [2], 

Proposition 1.1. Suppose S can be decomposed into S = M + rj - ( M , M ). 
Then for any x > 0, the following equation holds: 

(3) u(x) = logx + \E [ rjl d(M, M) s . 

Jo 
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This proposition motivates the following definition. 

Definition 1.2. A filtration (Qt) is called a finite utility filtration for 
S, if S' is a (^j)-semimartingale with decomposition dS = dM + (■ d(M, M), 
where ( is (^-predictable and belongs to L 2 (M), that is, E Jq ( 2 d(M, M) < 
oo. We write 

F = {(Tit) D (Et)\(TLt) is a finite utility filtration for S}. 

We now compare two traders who take their portfolio decisions not on the 
basis of the same filtration, but on the basis of different information flows 
represented by the hltrations (Qt) and (Tit), respectively. Suppose that both 
filtrations (Q t ) and (Tit) are finite utility hltrations. We denote by 

(4) S = M + (-(M,M) 

the semimartingale decomposition with respect to (Qt) and by 

(5) S = N + /3 ■ (N, N) 
the decomposition with respect to (Tit). Obviously, 

(M,M) = (S,S) = (N,N) 

and, therefore, the utility difference is equal to 

u n (x)-ug(x) = \E [ T (fi 2 — C 2 ) d(M,M). 

Jo 

Furthermore, (4) and (5) imply 

(6) M = N — ((- fi) ■ (M, M) a.s. 

If Qt C Tit for all t > 0, equation (6) can be interpreted as the semimartingale 
decomposition of M with respect to (Tit). In this case one can show that 
the utility difference depends only on the process /u = £ — p. We therefore 
use the following notion. 

Definition 1.3. Let (Qt) be a finite utility filtration and S = M + £ • 
(M,M) the Doob-Meyer decomposition of S with respect to (Qt). Suppose 
that (Tit) is a filtration such that Qt C Tit for all t £ [0, T], The (Tft)-adapted 
measurable process /x satisfying 

M — ii t d(M, M) t is a (Tfj)-local martingale 
Jo 

is called information drift (see [19]) of (Ti t ) with respect to (Qt). 

The following proposition relates the information drift to the expected 
logarithmic utility increment. 
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Proposition 1.4. Let (Gt) and (Tit) be two finite utility filtrations such 
that Gt C Tit for all t £ [0, T\. If p is the information drift of (Tit) w.r.t. (Gt), 
then we have 

r T 

u-h(x) — ug(x) = \E / p 2 d(M,M). 

Jo 

Proof. See Theorem 2.13 in [2]. □ 

So far we only required the information drift to be measurable and adapted. 
Due to the continuity of S, we have the following. 


Proposition 1.5. The information drift, provided it exists, may he cho¬ 
sen to he predictable. 

Proof. Suppose p is a measurable and ((/^-adapted process such that 

M — f p t d(M, M) t 
Jo 

is a (<7t)-local martingale. We denote by P p the predictable projection of p 
with respect to (Gt)- We will show that M — P p- (M , M) remains a ((/j)-local 
martingale. 

Let r be a stopping time localizing M such that M T , the martingale M 
stopped at r, is bounded. To simplify notation, we assume M T = M. Let 
0 < s < t, A e G s and e > 0. Then 

E(l A (M t -M a+e ))=E(l A J* p r d(M,M) r 

rt 


= E 1 j\E 


S+E 

t 


Pr d(M, M) r \Gs 


= E 1 A E 


J s+e 


P p r d(M,M) r \Gs 


= E [1a 


/ P p r d(M, M), 

(see Theorem 57, Chapter VI in [11]). By dominated convergence, the left- 
hand side of this equation converges to E(t A (M t — M s )) as e | 0. The right- 
hand side converges by similar arguments. Hence, we obtain 

E(l A (M t -M s )) = E(l A J* p p r {M,M) r ' S ), 

which means that M - P p- (M , M) is a ((/^-martingale. □ 

We close this section by recalling some basic properties of information 
drifts. 
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Lemma 1.6. Suppose the filtration {Et) is a finite utility filtration with 
respect to which the Doob-Meyer decomposition of S is given by S = M + 
r] ■ (M,M). Let {Tit) be « filtration satisfying Et C Tit for all t £ [0, T] and 
suppose that {Tit) has an information drift p with respect to {Ft). Then the 
following properties hold true: 

(i) If p belongs to L 2 {M), then the maximal expected utility ut-c{x) is 
finite for all x > 0. 

(ii) The set of finite utility filtrations F is equal to the set of all filtrations 
containing {Et) and possessing an information drift A with respect to {Et) 
such that A £ L 2 {M). 

(iii) If {Tit) is a finite utility filtration, then p is orthogonal to ifipfiM), 
the subspace of {Et)-predictable processes in L 2 {M). 

(iv) If {Q t ) is a filtration such that Et C Qt C Tit for all t £ [0, T], then 
there is also an information drift k of {Qt) with respect to {Et). More pre¬ 
cisely, k is equal to the L 2 {M)-projection of p onto the subspace of the 
{Qt)-predictable processes. 


Proof. Properties (i) and (ii) are obvious. For property (iii), let S = 
N + (3 ■ (N, N) denote the Doob-Meyer decomposition of S relative to {Tit), 
and let 8 £ L 2 T {M). Since 9 is adapted to both {Et) and {Tit), the integrals 
{9 ■ M) and {9 ■ N) are square integrable martingales with expectation zero. 
Therefore, 


E [ T 9pd{M,M) = E 
Jo 


= E 
= 0 . 


[ T 6/3d(M,M) - [ T 9rjd{M,M) 
/ o Jo 


9dM — / 8dN 
Uo Jo 


Thus, p is orthogonal to Ljr(M). For property (iv), we refer again to [2]. 

□ 


2. General enlargements. Assume again that the price process S is a 
semimartingale of the form 

S = M + rj- (M,M), 

with respect to a finite utility filtration {J~t). Moreover, let {Qt) be a filtra¬ 
tion such that Et C Qt, and let a be the information drift of {Qt) relative 
to {Et). And, for simplicity of notation, suppose in this section that time 
horizon is infinite, that is, T = oo. We shall aim at describing the relative 
information drift a by basic quantities related to the conditional probabil¬ 
ities of the larger a -algebras Qt with respect to the smaller ones Et,t> 0. 






S. ANKIRCHNER, S. DEREICH AND P. IMKELLER 


Roughly, modulo some tedious technical details to be specified below, the 
relationship is as follows. Suppose for all t > 0 there is a regular conditional 
probability Pt(-, •) of P given P t , which can be decomposed into a martingale 
component orthogonal to M , plus a component possessing a stochastic inte¬ 
gral representation with respect to M with a kernel function kt(-, •)• Then we 
shall see that, provided a is square integrable with respect to d(M,M) <g> P, 
the kernel function at t will be a signed measure in its set variable. More¬ 
over, this measure is absolutely continuous with respect to the conditional 
probability, if restricted to Qt, and a coincides with their Radon-Nikodym 
density. 

We shall even be able to show that this relationship also makes sense 
in the reverse direction. Roughly, if absolute continuity of the stochastic 
integral kernel with respect to the conditional probabilities holds, and the 
Radon-Nikodym density is square integrable, the latter will turn out to 
provide an information drift a in a Doob-Meyer decomposition of S in the 
larger filtration. 

We shall finish the section with an illustration of this fundamental rela¬ 
tionship by discussing some simple examples of particularly enlarged filtra- 
tions. 

The discussion of the details of this fundamental relationship requires 
some care with the complexity of the underlying filtrations and state spaces. 
Of course, the need to work with conditional probabilities first of all confines 
us to spaces on which they exist. Let therefore (kl,P, P) be a standard Borel 
probability space (see [22]) with a filtration (Pt)t >o consisting of countably 
generated cr-algebras, and M a (^y°)-local martingale. We will also deal 
with the smallest right-continuous and completed filtration containing (Pf), 
which we denote by (Pt). We suppose that Pq is trivial and that every (Pt)- 
local martingale has a continuous modification. Since pf is a subfield of a 
standard Borel space, there exist regular conditional probabilities Pt relative 
to the (T-algebras P t °. Then for any set A G P, the process 

(t,u) P t (uj,A) 

is an (Pf )-martingale with a continuous modification (see, e.g., Theorem 4, 
Chapter VI in [11]). Note that the modification may not be adapted to 
(Pt), but only to (Pt). Furthermore, it is no problem to assume that the 
processes Pt(-,A ) are modified in a way such that Pt(uj, •) remains a measure 
on P for Pvf-almost all (u>,t), where Pm is a measure on Q x R + defined by 
P M (T) = E f 0 °° l r (w, t) d(M, M) t ,TeP®B + . 

It is known that each of these martingales may be uniquely written (see, 
e.g., [24], Chapter V) 

(7) P t (-,A) = P(A) + [* k s {-,A) dM s + Lt 

Jo 
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where k(-,A) is (.^-predictable and L A satisfies (L A ,M) = 0. 

Now let (Gt) be another filtration on (Gl,P, P) satisfying 

?? c g? 

for all 0 < t < T. We assume that each cr-field Gt is generated by a count¬ 
able number of sets, and denote by (Gt) the smallest right-continuous and 
completed filtration containing (Gt )- It is clear that each cr-field in the left- 
continuous filtration (Gt~) is a lso generated by a countable number of sets. 
We claim that the existence of an information drift of (Gt) relative to (Pt) 
for the process M depends on whether the following condition is satisfied or 
not. 


Condition 2.1. kt(ut,-)\go is a signed measure and satisfies 

h(u, -)|go_ ^ Pt(u, -)\ g o_ 

for Pm- a.a (uj,t). 

Remark 2.2. Unfortunately, we have to distinguish between the fil- 
trations (P^), (Gt) and their extensions (Pt), (Gt)- The reason is that the 
regular conditional probabilities considered exist only with respect to the 
smaller cr-fields. On the other hand, we use stochastic integration techniques 
which were developed only under the assumption that the underlying filtra- 
tions satisfy the usual conditions, and this necessitates working also with 
the larger cr-fields. 

Let us next state some essential properties of the Radon-Nikodym density 
process existing according to our condition. 


Lemma 2.3. Suppose Condition 2.1 is satisfied. Then there exists an 
(Pt <8> Gt)-predictable process 7 such that, for Pm- a.a. (u>,t), 


7t(cu,u/) 


dk t (io , •) 
dP t (io , •) 



(J). 


Remark 2.4. Note that 7^(0;,-) is ^--measurable. This is due to the 
fact that the predictable cr-algebra does not change by taking the left- 
continuous version of the underlying filtration. 


Proof of Lemma 2.3. Let ff = T for all n > 0 and i > 0. We denote 
by T the set of all tf. It is possible to choose a family of finite partitions 
('P i,n ) such that: 

• for all t e T, we have Gt- = cr(V t,n :i,n> 0 s.t. Tf = f), 




10 


S. ANKIRCHNER, S. DEREICH AND P. IMKELLER 


• if i < j, n < m and i2~ n = j2~ m , then V l,n C V 3,m . 

We define, for all n > 0, 

j >0 Ae£'’ n > ) 

Note that is (J^)-predictable and j is (^t)-predictable. 

Hence, the product of both functions, defined as a function on H 2 x M + , is 
predictable with respect to (fFt < 8 > Gt)- It follows that each 7 ” and, thus, 

7 = lirn inf 7 ”, 
n—> 00 

is (J 7 <g> £/t)-predictable. 

Now fix t > 0. We claim that kt(u>,-) = /. 7 t(w,u/)Pt(w,du/) and, hence, 
that 7 t (w, ■) is the density of k t (u,-) with respect to Pm- a.s. For 

all n > 0 , let j = j(n) be the integer satisfying t™ <t < t” +1 and denote by 
Q n the corresponding partition V 3,n . Observe that (Q") is an increasing 
sequence of partitions satisfying 

fr(Q n :n> 0 )=^°_ 

and, hence, 

7 t(u;,u/) = liminf 7 ^( 0 ;, a/) 


= liminf £ - d *‘ (a ’ , ' ) 


AeQ n 


Pt(v,A ) dP t (u,-) 


SL 


□ 


Lemma 2.5. If(t,co,Lo') *—> 0t{u,u') is {Pt®Gt)-predictable and bounded, 
then 

fjf 9 t (u;,u , )P t (u,dw , )d(M,M) t dP{u) = J j 6 t (u,u)d(M,M) t dP(u). 

Proof. Let 0 < r < s, He T r , B £G r and 

9 t ((j,u') = l] riS ](i)l a(w)1b(w 7 ). 


jjj 9 t {u,u , )P t (io,dco , )d{M 1 M) t dP(u) 

= j j\ A {uj)P t {u,B)d{M,M) t dP{u) 

= JJ° 1a(w)1s(w) d(M, M} t dP(w) 

= JJ d t (u,u)d(M,M) t dP(u;), 


Then 
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where the second equality holds due to results about optional projections 
(see Theorem 57, Chapter VI in [11]). By a monotone class argument, this 
can be extended to all bounded and (Ft <S> t?f)-predictable processes. □ 


Theorem 2.6. Suppose Condition 2.1 is satisfied and 7 is as in Lemma 2.3. 
Then 


a t {u) =7t(w,w) 

is the information drift of (Qt) relative to (Ft). 


Proof. Suppose r to be a stopping time such that M T is a martingale. 
For 0 < s < t and A e Q ®, we have to show 


E[l A (Mf-Mf)] = E 1 A J lu (co,io)d(M,M)l 

For notational simplicity, write M T = M and observe 
E[l A (M t - M a )] = E[P t (;A)(M t - M a )] 


= E 

= E 

= E 

= E 


(M t - M s ) f t k u (-,A)dM u 
Jo 


k u (-,A)d(M,M) l 


+ E[(M t - M s )Lf] 


J J^'y u (u},uj , )dP u (uj,du')d{M,M) l 


1 a (uj) / 7 u (uj,uj)d{M,M) l 


where we used Lemma 2.5 in the last equation. □ 


Corollary 2.7. (Qt.) is a finite utility filtration if and only if 

JJJ r $(u)iLo')Pt(tjj,du')d(M,M)tdP(uj) < 00 . 


Proof. This follows immediately from Lemma 2.5. □ 

We now look at the problem from the reverse direction. Starting with 
the assumption that (Qt) is a finite utility filtration, which amounts to 
E Jq a 2 d(M, M) < 00 , we show the validity of Condition 2.1. 

In the sequel, (Qt) denotes a finite utility filtration and a its predictable 
information drift, that is, 

M = M — [ a t d(M, M) t 

Jo 


(8) 
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is a (Gt)~ local martingale. To prove the main results (Theorems 2.10 and 
2.12), we need the following lemma. 

Lemma 2.8. Let 0 < s < t and V = {A±, ... , A n } be a finite partition of 
12 into G?-measurable sets. Then 


7 S 

E f s p {^)\;A k )t Ak d(M, M) u < 4 E(J* a 2 u d(M, M) x 


< oo. 


Proof. Let V = {A\, ... , A n } be a finite (^-partition. An application 
of Ito’s formula, in conjunction with (7) and (8), yields 

n 

J2\- 1a k lo g p s(-,A k ) - t Ak log P t (-,A k )] 


k =1 


= £ 

k =1 


r* l 


P u (-,A k ) 


t Ak dP u (-,A k ) 


+ ~ [ — r-r^'^A k d(P(-,A k ),P(-,A k )) l 


(9) 


= £ 

k =1 L 


2 Js Pu(;A k y Ak 


-/ ^E(-,A k )t Ak dM u - [ ^{ A A k )l Ak a u d{M,M) x 


Pu 

r* 1 


P u (-,A k ) 

rt / l. \ 2 




+ UXk) ^ A ^ lA k d(M,M) x 


1 


1 




2 A P„(-,A fc ) 2 

Note that P t (-, A k ) logP t (-,A k ) is a submartingale bounded from below for 
all k. Hence, the expectation of the left-hand side in the previous equation 
is at most 0. 

A priori it is not clear whether 

n rt b 

£ / -jE(;A k )t Ak dM u 

k=l Js u 

is integrable or not. Consider therefore, for all e > 0, stopping times defined 
by 

OO, u ; ^ A k , 


T k = i : 


inf{t > s : Pt(-,A k ) < e}, else, 
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and 


t = n A • ■ • A T e n . 


Observe that r £ —> oo as e j 0 and that the stopped process 


E 

fc=i • 


r tAre k v 

s T>, 


(■,A k )t Ak dM u 


has expectation zero, since 

rtAr £ n ^ 


rt/\T P _ __ \ 

Js k =1 “ / 

rtAr £ n / U \ 2 

/ E(ir) 

" /s fc=l' u ' 


= E 


£ Z 


A* 

£ z 


rt/\T e 




fc=i 


« rt 

U'=l/ S 


< oo. 

Similarly, one can show that the expectation of 

/■iAr® 1 

L P u (.,A k ) lAkdLuk 

vanishes. Consequently, we may deduce from (9) and the Kunita-Watanabe 
inequality 


^E 


i 


fc=i 


rt/\T e / r. 

P, 


{■,A k )t Ak d{M,M) t 


<^E 

fc=i 


^ r /-tAr® 




(■,A k )t Ak a u d(M,M) t 


S W 

tAr £ n / U \ 2 


<E\ 


rt/\T b '<■ / U \ 2 

J s E(tt) (;A k )t Ak d(M,M) z 


1/2 


x E 


rt/\r e 


aid(M,M) u \ 


\ 1/2 


which implies 


rtA-r® n 


£ 


E 

fc=i 


p, 


{;A k )t Ak d(M,M) u <4E( ol u d(M, M) 


r tAr e 
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Now the proof may be completed by a monotone convergence argument. □ 


Let T and {V l,n )i , n >o be a family of partitions as in the proof of Lemma 2.3. 
We define, for all n > 0, 


Z?(v,w') — y~] y l] t n !t n + j(t)lA(v') 

i >0 A&V i ’ n 


k t (v,A) 

Pt(co,AY 


Note that Z n is (Ft <8> £t)-predictable. We are now able to prove a converse 
statement to Theorem 2.6. Observe first the following: 


Lemma 2.9. For PM-almost all (u>,t) £ f2 x M + , the discrete process 
(Z t m (u;, -)) m >i is an L 2 (P t (u>, ■))-bounded martingale. 


Proof. Every statement in the sequel is meant to hold for Pm- a.a. 
(iu,i)£(lx M_|_. 

Let m> 0, l > 0 and j be the natural number such that C 

]t™,t™ + 1 ]. We start by proving that on we have 

= Z?(u,.). 

For this, let B £ 'P 7 ’ m and Al,... ,Ak £ p l ' m+1 such that x4i U • • • U A^ = B. 
Note that 


i^(-% B (.) Zr +V, •)] = £ Pt(av) 



k 

= ^2kt(u,Ai) 

i=l 

= kt(uj,B) 

= E P ^\t B (.)Z?(u>,-)) 


on Consequently, the process (Z™(uj, -))m>i is a martingale 

(with respect to a filtration depending on f). The martingale property im¬ 
plies that the sequence /(Z”) 2 (u;,u/)Pi(a;,du/) is increasing and, hence, by 
monotone convergence, 


supF JJ {Zt) 2 {u,uj l )P u (LO,doj l )d{M 1 M)t 

= E J sup J (. Z?) 2 (u,u')P u (u,du')d(M,M) t . 
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By Lemmas 2.8 and 2.5, we have 

sup eJJ {Z™) 2 {u,u')P u (u,duo')d(M,M) t 


sup E f(Z2) 2 (u,u))d(M,M) t 

n J 

: ^p^E/ n l+1 E 

i>0 h Ae / P i ' n 


kt(u,A) ' 2 
Pt(u,A ) 


d(M,M) t 


<4 E^j ald(M,M) u ^J < oo. 

This shows that ( Z n ) n >\ is an L 2 (P t (u, -))-bounded martingale. □ 


We now will show that k can be chosen to be a signed measure. For this 
we identify Pfiuj,-) with another measure on a countable generator of £/°_. 
We then apply the result that two Banach space valued measures are equal, 
if they coincide on a generator stable for finite intersections. 


Theorem 2.10. The kernel k may be chosen such that 

Qt- D k t (u,A) G R 

is a signed measure which is absolutely continuous with respect to Pt(u>, -)\go , 
for Pm- a.a. (u,t) G x [0,oo). This means that Condition 2.1 is satisfied. 

Proof. Lemma 2.9 implies that (Z™(u>, -))m>i is an L 2 (Pt(u, -))-bounded 
martingale and, hence, for a.a. fixed (Z 4 m (w,-))m>i possesses a limit 

Z. It can be chosen to be (Pt <8> £/t)-predictable. Take, for example, 

Z t = liminf(Z” V 0) + limsup(Z t n A 0). 

n n 

Now define a signed measure by 

k t (u,A)= j ±A{w')Zt{w,u')dP t {u,du'). 

Observe that k t (co,-) is absolutely continuous with respect to Pt(u,-) and 
that we have, for all A G V^ m with j2~ m < t, 

k t (u,A) = k t (ui,A) 

for Pm- a.a. (u>,t) GOx R + . One may also interpret 3 A i—> kt(u,A), as 

an L 2 (M)-v alued measure. By applying the stochastic integral operator, we 
obtain an L 2 (fl)-valued measure: f ( j k s (uj. A) dM s . Moreover, 

(10) P t (u,A) = P(A)+ [ t k s {u,A)dM s + Lf(u ;) 

Jo 
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for all A € Since the LHS and both expressions on the RHS 

are measures coinciding on a system which is stable for intersections, (10) 
holds for all A €E Gt-- Hence, by choosing k t (-,A) = k t (-,A) for all A £ Gt-i 
the proof is complete. □ 


Remark 2.11. Since k is determined up to Pm-iiu11 sets, we may assume 
that kt(uj,-) is absolutely continuous relative to Pt(to , •) everywhere. 


We close this section with some examples showing how (well-known) in¬ 
formation drifts can be derived explicitly, based on the formalism of The¬ 
orem 2.6. To this end, it is not always necessary to determine the signed 
measures kt(u, •) on the whole cr-algebras Q®, but only on some sub-c-fields. 
This is the case, for example, if 

Gt=F?\TH° t , 0 <t<T, 

where (Ti®) is some countably generated filtration on (Tl,F). 

Now suppose that kt(uj,-) is a signed measure on (7Y°_) satisfying 


for Hjvf-a.a (co,t)- Then we can show with the arguments of the proof of 
Lemma 2.3 that there is an (Ft <8> Tft)-predictable process /3 such that Pm~ 
a.e. 


Pt{u,u') 


dk t (a;, •) 
dP t {u, •) 



V)- 


The information drift of (Gt) relative to {J~t) is already determined by the 
trace of (/3t). For the corresponding analogue of Theorem 2.6, we shall give 
a more explicit statement. 


Theorem 2.12. The process 

a t {io) ={3 t (v,u) 

is the information drift of (Gt) relative to (Ft)- 

Proof. Suppose T to be a stopping time such that M T is a martingale. 
For 0 < s < t, A € TL 0 S and B £ F®, we have to show 


E[t A t B (M? -Mj)] = E 


t A t B / P u (u,u)d(M,M) 


For simplicity, assume M T = M , and observe, like in the proof of Theorem 

2 . 6 , 

E[l A t B (M t - M s )] = E[l B P t {;A)(M t - M 8 )] 

rt 


= E 


t A (u)t B (u) J f3u(u,v)d(M,M) t 


□ 
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Example 2.13. Let (Wt) be the standard Wiener process and (F®) 
the filtration generated by (Wt). Moreover, let (Yt) be a Gaussian process 
independent of T\ such that, for each pair s,t with 0 < s < t, the difference 
Yt — Y s is independent of Y t . We denote by wt the variance of Y t . 

We enlarge our filtration by 

n° t = a(W i +Y s :0<s<t) = a(W\ + Y t ) V a(Y t — Y s : 0 < s < t), 

and put G® = Ft V Tit, 0 < t < 1. Now observe that, for all C € o(Yt — Y s : 0 < 
s <t) and Borel sets B 6 B(M), we have 


Pt(;{Wi + Y t €B}nC) = P(C) J t B (x + W! 


W t + Y t )dP 


x=Wt 


= P(C) / i B (y + x)4> 1 - t +w t (y) dy 


x=Wt 


= P(C) [ (f>i-t+wt(y-W t )dy, 0 < t < 1, 
J B 


where 


fiviy ) — / 0 \\/2 e 

(2nv) L P 

Now observe that f(x,t) = P(C) f B 4>i-t+w t 
and satisfies 


y 2 /(2v) 

(y — x) dy is differentiable in x 


4-f(x,t) = P(C) [ - — (j>i- t+wt (y - x) dy 

ox J B 1 -t + w t 

for all 0 < t < 1 and i£M. By Ito’s formula, 


Pt(; {W! +Y t eB}nC) = /(0,0) + £ ^f(W s , s) dW s + A t , 0 < t < 1, 

where A is a process of bounded variation. Note that A is also a martingale 
and, thus, A = 0. Hence, 


kt(;{Wi + Y t eB}nC) 

= P(C)[ y ~^ ^ t+wt (y - Wt) dy 

J B l-t + wt 

= P(C) f t B (y + x) V + X X (j>!- t+wt (y) dy 
J 1—t + Wt 


f ^(oo + y,(aQ-iy,M ^ 

i{Wi+Y t eB}r\C l-t + wt 


As a consequence, 

Pt(u,v') = 


kt(io, duj') 


Pt(w, du') 


H° t 


W^ + YtW-Wtju) 

1—t + Wt 
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and by Theorem 2.12, 


W t - 


Jo 1 -s + w s 


0<t< 1 , 


is a martingale relative to (Gt)- 

Similar examples can be found in [8] where the information drifts are 
derived in a completely different way though. 


Example 2.14. Let (Wt) be the standard Wiener process and ( Ft ) the 
Wiener filtration. We use the abbreviation W t * = sup 0<s<t W s and consider 
the filtration enlarged by the random variable G = c > 0. Again, 

we want to apply Theorem 2.12 in order to obtain the information drift of 
Gt = Ft V er(G'). To this end, let Z t = sup i<r<1 (W r — Wt) and denote by p t 
the density of Z tl 0 < t < 1. Now, 


Pt(;G = 1 ) = P(W* VW t + Z t < c\F t ) 
= I l[o,c](y v x + Z t )dP 

rc—x 

= 1 [0,c](y) J o pt{z)dz 


x=W t ,y=W* 


x=W t ,y=W* 

for all 0 < t < 1. Note that F(x,y,t ) = l[o , c ](u) fo X Pt(z) dz is differentiable 
in x for all 0 < t < 1 and x £ M, and by Ito’s formula, 

Pt(-,G = 1) = F(0,0,0) + f -?-F(W s ,WZ,s)dW s +A t , 0<t<l, 

Jo ox 

where A is a process of bounded variation. Hence, 

k t (;G = l) = -^F(Wt,W t *,t), 0 < f < 1. 

Similarly, we have 

Pt(;G = 0) = H(W t ,W?,t), 0 < t < 1, 

and 

d 


where 


kt{-,G = 0) = —H(W t ,W t *,t), 0<t<l, 


r oo 

H(x,y,t) = t iCt0o) (y) + l [0}C] (y) / p t {z)dz. 

J c—x 
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As a consequence, 


Pt(v,u}') 


,du r 


, duj') 


cr(G) 




0<t< 1. 


3. Monotone convergence of information drifts. In the preceding section 
we established a general relationship between the information drift and the 
regular conditional probabilities of filtrations. In this framework the knowl¬ 
edge of the better-informed agent is described by a general enlarged filtra¬ 
tion (Qt) of (Tt). We shall now consider the question whether this situation 
may be well approximated by “stepwise initial” enlargements, for which we 
take Tt V Qt,- for t e [U,ti- (- 1 ), if the family (U)o<i<n is a partition of R+. 
One particularly important question in this context concerns the behavior of 
the information drifts along such a sequence of discretized enlargements. Of 
course, we expect some convergence of the drifts. We shall establish this fact 
rigorously in the following section. In the present section, we shall prepare 
the treatment of this problem by solving a somewhat more general prob¬ 
lem. Let (Gt)ne N be an increasing sequence of finite utility filtrations and 
sup n ug»(a:) be finite. We will show that the smallest filtration containing 
every (£/”) is then also a finite utility filtration. 

Since we will not deal with regular conditional probabilities in this section, 
it is not necessary to require our probability space (fi,E) to be standard. 

We use the terminology of Revuz and Yor [24]: H 2 (Ft) denotes the set 
of L 2 -bounded continuous (.Fj)-martingales, that is, the space of continuous 
(J~t, P)-martingales M such that 

supE(Ml) < 00 . 
t> 0 

We need the following characterization of H 2 (Et). 


Lemma 3.1 (Proposition 1.23 in [24]). A continuous ( J-t)-local martin¬ 
gale belongs to H 2 (Et) if and only if the following two conditions hold: 

(i) E(M$) < 00 , 

(ii) E((M, M)oo) < 00 . 


The properties (i) and (ii) are independent of the filtration considered. 
This is due to the fact that the quadratic variation of M does not change un¬ 
der a new filtration (Qt) for which M is still a semimartingale. We therefore 
have the following: 
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Lemma 3.2. Suppose M £ H 2 (Et)- Let (Qt) be a filtration such that M 
is still a ( Qt)-semimartingale. If 

M = M + A 

is a Doob-Meyer decomposition with respect to (Qt) with A$ = 0, then M 
belongs to H 2 (Q t ). 

Proof. Notice that Mo = Mo and (M, M) = (M, M). The claim follows 
now by applying Lemma 3.1 twice. □ 


Now let M be a continuous (Et)- local martingale and (Qt) n >i an increas¬ 
ing sequence of filtrations, that is, for all t > 0, we have 

TtCQt C---cQt C Qf +1 C • • •. 

We assume that, for all n > 1, the process M is a (t/”)-semimartingale with 
Doob-Meyer decomposition of the form 

M = M n + [ n™d(M, M) s , 

Jo 

where /i n is )-predictable. We then have the following asymptotic prop¬ 
erty. 

Lemma 3.3. If the processes (^ n ) nS N converge to some /r in L 2 (M), 
then 

M- [ fi s d(M, M) s 
Jo 

is a local martingale with respect to Qt = V n >iQt A > 0. 


Proof. Suppose the stopping time r reduces M such that M T is a 
bounded martingale. Note that Lemma 3.2 implies that the stopped pro¬ 
cesses ( M n ) T are (^)-martingales. 

For simplicity, we assume M r = M. Choose a constant C > 0 such that 

roc 

\M\ < C and E / (plf) 2 d(M , M) s < C 2 foralln>l. 

Jo 


Now let £ > 0, 0 < s < t and A £ Q s . It suffices to show 


E[l A (M t — M s )] — E 
We start by choosing uq such that 


/ hs d(M, M)< 


<£. 


IK - ^IIl 2 (m) < 


4^((m,m)) 00 
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for all n > no- 

Note that U n >n 0 is an algebra generating the cr-algebra Vn>i @s = 
Q s = Vn> no 5"- Hence, we can find a sequence (Aj) ie N of sets in (J n>n 0 ^s 
such that P(A A Ai) —> 0. A subsequence of (IaJigN converges to 1 a almost 
surely and, therefore, we may choose n > uq and A E Q™ satisfying P(A A 
A) < (^) 2 and 

/ft \ 1/2 £ 

[EJs(t A -t A ) 2 (M,M)) <—. 

Hence, we have 


\E[l A (M t - M s )} - E[l A (M t - M s )]| < ^[(1^ - 1 A )(M t - M s )]\ 

< P(A A A) 1/2 (E(M t - M s f) 1/2 


< 


By applying the Kunita-Watanabe inequality, we get, for n > no, 


E 


1 A / Hu d(M, M) l 

J S 

rt 


-E 


1 a / ttd(M,M) t 


< 


E 


/ {Hu - t£)d{M,M) t 

J S 

1/2 


+ 


E 


< (eJ l A d(M,M)^ (e j™(iis-H n s ?d{M,M) 

t \ 1/2 / fOG 

(1a - 1a) 2 d(M, M)\ yE j (^) 2 d(M, M) 


+ E 


(1a -1a) J d(M, M) r 
1/2 

1/2 


< {E{M,M)f/ 2 \\H-h u Wl\m) + j 


4 


and thus, 


E[l A (M t - M s )} - E 


1a/ Hu d(M, M) x 


< \E[l A {M t - M s )} - E[l A (M t - M s )}\ 


+ 


+ 


E[l A {M t -M s )}-E 


t A Hud(M,M) r 


t A Hud(M,M) r 


-E 


1a / Hud{M,M) l 


£ £ 

S 2 +H r £ ' 
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□ 


We are now in a position to prove the main result of the section. 

Theorem 3.4. If sup n>1 ||/-i n || 2 2 ( M ) < oo, then (/r n ) converges in L 2 (M) 
to a process fi. Moreover, 

M- f /ad{M,M) 


is a local martingale with respect to Qt = \J n >\Qtit — 0- 

Proof. Set c = sup n>1 Let m > n > 1, and note that //" — 

p n is the information drift of (Qt 1 ) relative to (Qf). Therefore, property (iii) 
in Lemma 1.6 implies 



as n—> oo. Therefore, {^ n } n >i is a Cauchy sequence in L 2 (M). By com¬ 
pleteness of L 2 (M), there exists a unique (^f)-predictable process /i° E 
L 2 (M) such that lim n ^ 00 /i n = in L 2 (M). By Lemma 3.3, the process 
M — f fi° d(M, M) is a (^)-local martingale. □ 

4. Continuous and initial enlargements. In this section we relate general 
enlargements of filtrations to “initial enlargements” along discrete partitions 
of [0,T], for finite horizon T. The knowledge of the insider is modeled by an 
arbitrary filtration (f?t)t£[o,T]> satisfying Q t D Tt, 0 <t<T. For s E [0, T], we 
set 



t<s , 
t> s. 


Again, the analysis of this section does not require our probability space 
to be standard. 

Remark 4.1. In the case where the cr-field G s -,s E [0, T], is generated 
by a countable number of events, say, (A n ) n€ N, the enlarged filtration Qf 
can be viewed as initial enlargement at time s in the classical sense. In that 
case Q s _ = (r(tA n '■ n E N) and one has, for t E [0, T], 


F f V^-=T t Va(l^:nEN). 


The set {0,1} N can be endowed with a metric so that it becomes a Polish 
space with corresponding Borel—cr-field £>({0,1})® N . Hence, the filtration 
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(Qf) can be seen as initial enlargement at time s induced by the random 
variable G:Q —* {0, l} N ,u; i—> (lA„(^))neN- In particular, the standard the¬ 
ory of initial enlargements is applicable. See [21]. 

In the following, we assume that (£?*) is for arbitrary s G [0,T] a finite 
utility filtration. For 0 < s < t < T, we denote 

T 

7To([0 ,s)x(t,T]) = F(s,t) = ±E j (^r ) 2 d(M, M) r , 

where fi s is a (^|)-information drift. So far ttq is dehned only on the set 
J = {[ o, s) x (t,T]: s <t}. As the next lemma shows, 7r 0 can be extended to 
a measure on the Borel sets of D = {(s, f)£l 2 :0<s<KI}. 

Lemma 4.2. There exists a unique measure ir on the Borel sets B(D ) of 
D satisfying -tt\j = ttq. 

Proof. Uniqueness is an immediate consequence of the measure ex¬ 
tension theorem. In order to show the existence of an extension, it satis¬ 
fies to verify the following property which essentially amounts to count¬ 
able additivity on a generating semiring (see [13], Chapter II, Satz 3.8): 
For any (s,t) G D and any sequence (s n ,t n ) ne pj in D with s n < s, t n >t 
and lim ra _ >00 (s n , t n ) = ( s,t ), we have lim n ^ 00 F(s n ,t n ) =F(s,t). Moreover, 
F(s n ,t n ) < F(s,t) < oo. 

Let s n , t n , s and t as above. Without loss of generality, we assume that 
(s n ) is monotonically increasing. For it G [t,T ], we consider the filtrations 
(^r")re[u,T]i ra G N, over the time interval [u,T], The filtrations are mono¬ 
tonically increasing with VneN @r n = r £ [ u , T], Since (Hr")r£[u,T] are 
(&r")-mformation drifts, it follows (by Lemma 1.6) that 

E [ T {fi s - //")//" d(M, M) = 0. 

J U 

In particular, 

E f T (n Sn ) 2 d{M,M) <E f T (n s ) 2 d(M,M) < oo. 

Ju Jt 

By Theorem 3.4, the processes (t L r ri )r£[u,T] converge to the information drift 
(lJr)re[u,T] i n L 2 (M ; [u,T]). Therefore, for any u G (t, T], 

lim inf E ^(//«) 2 d(M, M) > E / T (//) 2 d(M, M). 

n ~*°° Jt n Ju 

Due to the continuity of M, the right-hand side of the previous equation 
tends to E / t T (/P) 2 d(M, M) as u J. t. Consequently, we obtain lim^^oo F(s n , t n ) 
F(s,t). □ 
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The measure 7r describes the utility increase by additional information. 
As will be shown below, n(D) is finite if and only if (Qt) is a finite utility 
filtration. 

We now approximate the general filtration (Qt) by filtrations that can be 
seen as successive initial enlargements. Let A : 0 = so < • • • < s n = T, n £ N, 
be a partition of the interval [0, T] . We let, for r E [sj, Sj+i), * = 0,..., n — 1, 

S r A = Q St . V Tr. 

Proposition 4.3. For i = 0,... ,n — 1, let p Si be a (Qp)-information 
drift and set /i A := pp for r E [sj, Sj +1 ). Then p A is a Q A -information drift. 
Moreover, 

1 f T (fj. A ) 2 d(M, M) r = tt(D a ), 
do 

where D A := {(s, t) S D: 3i G {0,..., n — 1} with s < Si and t>Si}. 

Proof. It is straightforward to verify that /r A is an information drift 
for (Q A ). Moreover, 

\E f T {fi A ) 2 d(M,M) r 
do 

Tt 1 pS 

= lE E I (j*?) 2 d{M,M) r 

i=0 Jsi 

Tl —1 / r p 1 r Jf' \ 

= lE(W (hr') 2 d(M,M) r -E [ (/i P) 2 d(M,M) r ) 

i= Q ^ 1 / 

n— 1 

= 7r([°, x (si, Si+i]) = ? r (^ A )- 

i =0 U 

We can now state the main theorem of this section. 

Theorem 4.4. Let A n , n G N, be a sequence of partitions of the inter¬ 
val [0,T], the mesh of which tends to 0. Ifir(D) is finite, then the information 
drifts fi An converge in L 2 (M) to a (Qt)-information drift //. Moreover, the 
utility gain of the insider satisfies 

u(Qt,x) — u(Tt,x) = \E I p 2 d(M, M) = tt(D). 

do 

If n(D) is infinite, then so is the utility gain of the insider. 

The proof of the theorem is based on the following proposition. 
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Proposition 4.5. If tt(D) < oo ; then there exists a (Qt)-information 
drift /r. Moreover, 

IIImII ! 2 (m) = 7r (^ ) )- 

Proof. Let A n , n £ N, be as in the above theorem with the addi¬ 
tional assumption that A n+ i is a refinement of A n for all n £ N. Then 
one has Q An C Q An+1 for any t £ [0, T]. By Proposition 4.3, ^ 1111 (jvy) = 
jr(I)A") < 7 t(D). Due to Theorem 3.4, the information drifts /i An converge 
to a (VnGN^i An ) = (&-^information drift /j, in L 2 (M). Using monotone 
convergence, we obtain that 

*( D ) = AD An ) = n Iim 111 n LA " 11 £2 (M) = 1 1 \n \|| 2(M) . 

Since every cadlag (f/ t _)-martingale is as well a (<5 t )-martingale, is a (f/t)- 
information drift. □ 

Proof of Theorem 4.4. Assume that tt(D) is finite. Since the mesh of 
the partitions A n tends to zero, one has limj^oo t D A n (x ) = 1 for all x £ D. 
Consequently, the dominated convergence theorem yields 

(11) lim Tr(D An ) = tt(D). 

n—>oo 

We established the existence of a ((^-information drift /r in Proposition 4.5. 
Recall that, by Lemma 1.6, the processes /r A " and ^ — /i An are orthogonal 
in L 2 (M). Consequently, 

II h IIl 2 (M) = IHli 2 (M) ~ ll/ iAn |li 2 (M)' 

Due to (11), the right-hand side of the previous equation converges to 0. 
Hence, /r An converges to ^ in L 2 (M). The remaining statements are conse¬ 
quences of Propositions 4.5 and 1.4. □ 

5. Additional utility and entropy of filtrations. In this section we con¬ 
sider the link between the additional expected logarithmic utility of a better 
informed agent and the entropy of the additional information he possesses. 
The additional utility was first expressed in terms of a relative entropy in 
[23], page 1103, for a particular example. More generally, [3] discussed the 
link between the absolute entropy of a random variable describing initially 
available additional information and the utility increment of better informed 
agents. Here we shall see that the expected logarithmic utility increment is 
given by an integral version of relative entropies of the a-algebras of the 
filtration. This notion can best be understood as the limit of discrete en¬ 
tropy sums along a sequence of partitions of the trading interval as the mesh 
goes to 0. Alternatively, we are able to give an interpretation of the utility 
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increment by Shannon information differences between the filtrations of the 
agents. In particular, we shall see that these differences are independent of 
any local martingales the filtrations may carry. 

Suppose that the assumptions of Section 2 are satisfied. Moreover, we 
assume that M is a continuous local martingale satisfying the (PRP) relative 
to (. J-t), which simply means that L A = 0. Equation (7) simplifies to 

P t (;A)=P(A) + [ t k s (;A)dM s , 

Jo 

where k(-,A ) is as in Section 2. Let again (Gt) be a filtration satisfying 
JP. 0 C Gt and being generated by countably many sets. To simplify nota¬ 
tion, we assume the filtration (Gt) to be left-continuous. Let (Gt) be the 
smallest completed and right-continuous filtration containing (Gt)- I n the 
following, we assume that (Gt) is a finite utility filtration and denote by [i 
its predictable information drift, that is, 

M = M — [ fi t d(M, M) t 

Jo 

is a (t/j)-local martingale. Recall that, by Theorem 2.10, we may assume 
that kt(u>,-) is a signed measure. For a fixed r > 0, we define pL as the 
information drift of the initially enlarged filtration (G[), defined as in the 
beginning of the preceding chapter. For stating the main result we need the 
following lemma. 

Lemma 5.1. Let 0 < s <t and (V m ) m >o be an increasing sequence of 
finite partitions such that (j(V m : m > 0) = G°- Then 

lim E f J2 fir) (;A)l A d(M,M) u = E f (p s u ) 2 d(M, M) u 

m Js J S 

and 

lim E f J2 ^(;A)t A pL s u d(M,M) u = E f (pt s u ? d(M,M) u . 
m Ae_v m 

Proof. By Lemma 2.10, the process 

Y™(u,J)= Y, jrfaA)t A (J), m> 1, 

is a L 2 -bounded martingale for Pm- a.a. (oj,u) S 11 x [s,t\- Hence, (Y m ) con¬ 
verges Pm- a.s. to the density 

_ k u (-,du') 
lu ~ P u (;duj f ) 
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By Theorem 2.6, we have 


7„(w,u/) = /4M 

Pm~&- s. on x [s,t] and, hence, the first result. In a similar way, one can 

prove the second statement. □ 

We next discuss the important concept of the additional information of a 
c-field relative to a filtration. 


Definition 5.2. Let A be a sub-c-algebra of T and R, Q two proba¬ 
bility measures on T. Then we define the relative entropy of R with respect 
to Q on the cx-held A by 


H A (R\\Q) 



if 

else. 


Moreover, the additional information of A relative to the filtration (F r ) on 
[s,t\ (0 < s < t < T) is defined by 

H A (s, t) = j H A {P t {w, -)ll^(w, •)) dP{u). 


The following lemma establishes the basic link between the entropy of a 
filtration enlargement and additional logarithmic utility of a trader possess¬ 
ing this information advantage. 


Lemma 5.3. For 0 < s <t, we have 

rt 


H Q o{s,t) = \E j [) 2 d(M,M ) u . 


Proof. Let (' P m ) m >o be an increasing sequence of finite partitions such 
that a(V m :m> 0) = Q Recall that, by (9) 


E 

Aev™ 


1a log P S {-,A) - 1a log P t (-,A) 
P k 


= E 

Ag-pm L 


k, 


P„ 


(;A)t A dM u - -f (-,yl)lA/ind(M,M), 


+ 


p, 


1 Pfk 


is \Pu 


{;A)t A d{M,M\ 
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Since M is a local martingale, we obtain by stopping and taking limits if 
necessary 



Note that in the previous line fi may be replaced by /P, because (/i — /i s ) 
is orthogonal to L' 2 (M)(G S ) [see property (iv) in Lemma 1.6]. Applying 
Lemma 5.1 yields 



Fatou’s lemma implies 

lim inf H-p m ( S,t ) > Hgo(s,t). 

m s 

On the other hand, we have iLp™(s, t ) < Hgo(s,t), since V m C G® and, thus, 

lim H-pm (s. t)=Hgo( s ,t), 
which completes the proof. □ 

Let us now return to the stepwise approximation of a filtration enlarge¬ 
ment along a sequence of partitions of the trading interval by “initial en¬ 
largements,” and define their respective information increment. 

Definition 5.4. Let A: 0 = so < ■ • ■ < s n = T, n E N, be a partition of 
the interval [0,T] and let /r A be the information drift of (G^)- The additional 
information of (G^) relative to (JF r ) is defined as 


n— 1 


Ha — V s *+i)- 

i =0 


Theorem 5.5. We have 



Proof. This follows from Theorem 4.4 and Lemma 5.3. □ 

Example 5.6. Let Gt = Ft V<t(V), where V is a finite partition in J~t- 
Then = /i and by Lemma 5.3, 
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If T 0 is trivial, then 

H g o(0,T) = -J2 P(A)logP(A), 

Aev 

which is the absolute entropy of the partition V. Thus, the additional loga¬ 
rithmic utility of an agent with information (Qt) is equal to the entropy of 
V. This example shows that there is a link between logarithmic utility and 
the so-called Shannon information. 

Definition 5.7. Let X and Y be two random variables in some mea¬ 
surable spaces. The mutual information (or Shannon information) between 
X and Y is defined by 


I{X,Y)=H(P x ,y\\Px®Py). 

Now let Z be a third random variable. The conditional mutual information 
of X and Y given Z is defined by 

I(X : Y\Z) = E[H(P XX \z\\P x \ z ®P y \z)}, 

provided the regular conditional probabilities exist. 

If A is a sub-cr-algebra of T, then we write id .4 for the measurable map 
(D, T) —> (£1,A),uj i — > lo. For two sub-er-algebras A and V, we abbreviate 

I(A,V) = I( kU,idp). 

Since our probability space is standard, for any sub-cr-helds A,T>,£ of T, 
there exists a regular conditional probability Tld^idMidg and we define 

I(A,V\£) : = I (id_ 4 , idx>|idg). 

The mutual information was introduced by Shannon as a measure of in¬ 
formation. It plays an important role in information theory (see, e.g., [16]). 


Theorem 5.8. 


lim ^ Ttr ° 


| A|—>0 




uei,K»\ ^)=\e f* 




Proof. Note that, for three random variables X, Y and Z, we have 

dP(x,Y)\z _ dPx\(Y,z) 
d{Px\z®Py\z) dP x \z 
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This property implies that one has, for 0 < s < t < T, 


I(G° a ,F? \J*)= / /log 


(IP: 


id r o |id 


log 


^dgolid^o 

P t (-,duj') 


dP(J) dP(u) 
dP(J) dP(uj) 


Ps(-,dw') \go 
= H go(s,t). 

Thus, the assertion is an immediate consequence of Theorem 5.5. □ 


This result motivates the following notion. 

Definition 5.9. The information difference of ( Q J?) relative to (Ff) up 
to time T is defined as 

A(S»,^) = lim 

i 

Remark 5.10. Note that we did not use M in our definition of the 
information difference of (Q®) relative to (F^). However, by Theorem 5.8, the 
information difference may be represented in terms of any local martingale 
satisfying the (PRP). 

Theorem 5.8 can be reformulated in the following way. 

Theorem 5.11. The additional utility of an agent with information (Qt) 
is equal to the information difference of (Q®) relative to (F ( f), that is, 

ug(x) - Ujr(x) = A(Q°,F q ). 

If (Qt) is initially enlarged by some random variable G , then the informa¬ 
tion difference of (Qf) relative to (Fff) coincides with the Shannon informa¬ 
tion between G and (Fji). 

Lemma 5.12. Let Qt = Ft V (j(G), where G is a random variable with 
values in some Polish space. Then 

A(Q°,F°) = I(G,F% \J%). 

Proof. Let 0 < s < t. By standard arguments, we have I(Q ®, F®\F^) = 
I(G,J*\J*) and 

I(G,F?\F%) = I(G,(F?,F Q S )\F° 0 ) 

= I(G,J*\F?) + I(G,F?\f8) 
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(see, e.g., [16], Theorem 1.6.3). By iteration, we obtain, for all partitions A, 

i 

and, hence, the result. □ 

Theorem 5.13. Let Q f ° = P t ° V cr(G), where G is a random variable 
with values in some Polish space. Then the additional logarithmic utility of 
an agent with information (Qt) is equal to the Shannon information between 
G and {Pjf conditioned on Tq, that is, 

ug(x) -ujr( x ) =I(J%,G\J$). 

In particular, if J-q is trivial, then the additional utility is equal to I(J-j ., G). 
Proof. This follows from Lemma 5.12 and Theorem 5.8. □ 


Remark 5.14. If Qf = V o{G) and G is ./^-measurable, then the 
mutual information I(P%, G\Pn) is equal to the conditional absolute entropy 
of G (see also [3]). 


Example 5.15. Let P) be the one-dimensional canonical Wiener 

space equipped with the Wiener process (Wt)o<t<i- More precisely, 
R = C([0,1],M) is the set of continuous functions on [0,1] starting in 0, T the 
u-algebra of Borel sets with respect to uniform convergence, P the Wiener 
measure and W the coordinate process. (Pt)o<t<i is obtained by completing 
the natural filtration (J~t)o<t<i • Suppose the price process S is of the form 

St = exp(Wt + bt), 0<f<l, 

with b G R. We want to calculate the additional utility of an insider knowing 
whether the price exceeds a certain level or not. More precisely, we suppose 
the insider to know the value of 

G=i (C)00) (sr), 

where c > 0 and = maxo<£<i St- By Remark 5.14, the additional utility 
is equal to the entropy 

H(G) =plogp+ (1 —p) log(l -p), 

where 


p = P(St>c). 

This may be calculated via Girsanov’s theorem. Namely, we have 


P(Sl >c ) 


P ('V t G [0,1]: max Wt + bt> log c ) 

V *£[0,1] / 

log c\ 


exp( blogc — —s 


10 


\/2tts 3 


exp 


hsA)* 

2s J 
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6. Mutual information estimates. In this final section we apply some 
results from information theory to derive estimates for the information of 
a better informed agent. This yields a priori estimates for the agent’s ad¬ 
ditional expected logarithmic utility in the light of the preceding section. 
Among other facts, the differential entropy maximizing property of Gaus¬ 
sian laws will play a role. We adopt the notation of [16]. 

Before we provide the information estimates, we summarize some basic 
facts of the mutual information (see [16], Theorem 1.6.3). For random vari¬ 
ables X, Y, Z in some Borel spaces, the following properties hold: 

(1.1) I(X,Y\Z) > 0 and, I(X,Y\Z ) = 0 if and only if X and Y are indepen¬ 
dent given Z. 

(1.2) I(X, (Y, Z)) = I(X, Z) + I(X,Y\Z). 

(1.3) If A is a continuous random variable with finite differential entropy, 
then 


I(X,Y) = h(X) - h(X\Y). 

For some fixed integer d £ N, let X be a dj.-measurable R^-valued random 
variable. Moreover, denote by Y a d-dimensional r.v. that is independent of 
the ex-field Tj.. We consider the enlarged filtration Qf = J-f V &(G), where 
G:=X + Y. 

Lemma 6.1. Suppose that the law of Y is absolutely continuous with 
respect to Lebesgue measure and has finite differential entropy 

hm= ~J 

Then 

(12) I{G,J%) = h{X + Y)-h{Y). 

Proof. Due to property (1.2), we have 

I(G,J%) = I(X + Y, X) + I(X + Y,J%\X). 

Given X, the r.v.’s X + Y and id^o are independent. Therefore, (1.1) and 

(1.3) lead to 

I(G, Ft) = h x + Y,X) = H x + Y )~ K x + Y\X) = h(X + Y) — h(Y) .□ 

Now assume the perturbation Y to be a Revalued centered Gaussian r.v. 
that is independent of 
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Lemma 6.2. Suppose that X e L 2 (P) and let Cx and Cy denote the 
covariance matrices of X and Y, respectively. Then 


(13) 


I{G,J%)<\ log 


det(Cx + C Y ) 
det (Cy) 


Moreover, equality holds in (13) if X is Gaussian. 


Proof. The distribution of Y is continuous with respect to Lebesgue 
measure and has finite entropy. Therefore, 

I(G,J%) = h(X + Y)-h(Y). 

Let Cx and Cy denote the covariance matrices of X and Y, respectively. 
Due to the independence of X and Y, the random variable X + Y has the 
covariance matrix Cx+y = Cx + Cy. Next recall that the normal distribu¬ 
tion maximizes the differential entropy under a covariance constraint, that 
is, h(X + Y) < h(Z), where Z is a centered Gaussian r.v. with covariance 
matrix Cx+y ■ Therefore, 

I{X,X + Y) < h(Z) — h(Y). 

Using the formula for the differential entropy of Gaussian measures (Theo¬ 
rem 1.8.1, [16]), we obtain 

h(Z) - h{Y) = i log((2-7re) rf det(Cx+r)) - ^ log((27re) d det(Cy)) 

_ 1 det(Cy + y) 

“ 2 ° g det (Cy) ' 

If X is Gaussian, then h(X + Y) = h(Z ) and, hence, the second statement 
of the lemma follows. □ 


Corollary 6.3. Assume that additionally to the assumptions of the 
above lemma, the equation Y = kN is valid, where N is a d-dimensional 
standard normal r.v. and k > 0. Then 

g—, 

A , K 

3=1 

where A j (j = 1,..., n) denote the eigenvalues of Cx- 

Proof. The proof follows easily by computing the determinants in Lem¬ 
ma 6.2. □ 


The proof of Lemma 6.2 is based on the fact that Gaussian distributions 
maximize the differential entropy under a constraint on the covariance struc¬ 
ture. Let us recall the construction of entropy maximizing measures under 
a linear constraint. 





34 S. ANKIRCHNER, S. DEREICH AND P. IMKELLER 

Lemma 6.4. Let E c M. d be a measurable set, c > 0 and g:E —> [0,oo) 
a measurable map. Assume that there exist constants Z,t> 0, such that the 
measure v defined by 

= L e -ta(x) 
d\ d[J Z 

is a probability measure satisfying E u [g] = c. Then v is the unique probability 
measure maximizing the differential entropy among all continuous probability 
measures p on E satisfying E^[g\ = c. 

The entropy maximization problem is equivalent to minimizing the rela¬ 
tive entropy Lt(-\\X d ). Hence, the problem can be treated under more general 
constraints by using results of [9], Theorem 3.1. 

Proof. Let gbe a continuous probability measure on E with E^[g\ = c. 
Then 

= E^log f = E" log ft + E> logffi 
= —h(n) + log Z + tE^g = —h{p) + log Z + tE v g 

= -Kv) - E ” lo g -g- = -h(lJ-) + h{u). 

Since Tt(n ||p) > 0 and H{fi\\v) = 0 iff p, = v, v is the unique maximizer of 
the differential entropy. □ 

Remark 6.5. The above lemma can be used to derive similar results 
as obtained in Lemma 6.2. For instance, for E := R and g[x) := \x\, one ob¬ 
tains that the two-sided exponential distribution maximizes the differential 
entropy under the constraint E^g = c (c > 0). In particular, the measure v 
with = (2c) -1 e“l x l/ c satisfies 

E u [g\=c and h{v) = 1 + log(2c). 

Now let X be a real-valued r.v. in L 1 (P). Moreover, let k\ := E[\X — EX |] 
and Y be a two-sided exponential distribution with E\Y\ =: K 2 - Then due 
to Lemma 6.1, 

7( G ,^)<log(^). 

Example 6.6. We consider the classical stock market model with one 
asset. Let {J~t)t£\o,T\ be a Brownian filtration generated by the Brownian 
motion (.Bt)te[o,T] and denote by (E t ) its completion. The stock price is 
modeled by the process 


S t = S 0 ex.p{B t + bt}, 
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where So > 0 is the deterministic stock price at time 0 and b £ R. For some 
fixed times ti,... ,td £ (0,T] (d £ N), let X := (B ti )i=i t ... } d- We suppose that 
the insider bases his investment on the filtration Qf = f] s>t X s V cr(G), where 
G = X + kN and N is a standard normal r.v. in W l that is independent of 
Tt- Due to Lemma 6.2, the additional utility of the insider is related to the 
eigenvalues of the matrix 


( tl 

ti . 

• h\ 

tl 

ti ■ 

■ t-2 

\h 

. . CM 

-+o 

■ t d J 


Let us finish the section with an example for a general enlargement. 


Example 6.7. We reconsider the classical stock market model of Ex¬ 
ample 6.6 with T := 1. The knowledge of the insider at time t is modeled by 
Qt = C)r>tFr v cr({G s ) se [ 0 ,r]), where G t := Bi + (B t ) is a Brownian 

motion independent of (Bt) and g: [0,1] —► [0,oo) is a decreasing function. 
We are therefore in a setting similar to Example 2.13. We now calculate the 
utility increment from the perspective of the notion of information difference 
of filtrations. Let n be as in Section 4. For 0 < s < t < 1, we have 

7r ([0, s) x ( S ,t]) = I((G u ) ue[ o )S ],^ 0 |^ ) ) 

= I(G Sl X?\^)=I(G s ,B t \T°) + I(G s ,X?\T°,B t ) 

= I(Bi + B g ^_ s ),B t - B s | jJ) 

= I(B\ — B S + Bg(l_g-),Bt — B s ). 

Using the formula for the differential entropy for Gaussian measures, we 
obtain 


7r([0, s) x (s, t]) = h(Bi - B s + B g ^_ s )) - h(Bi -B t + B g ( i_ s) ) 


= 2 !og(27re(l - s + g( 1 - s))) 

- ^log(2vre(l -t + g( 1 - s))) 

1, l-s + g(l-s) 

2 ° S l-t + g(l-s)' 

Alternatively, one can express 7r([0, s) x (s,t]) as 

rt i 


vr([0,s) x M ) = \l Y 


-u + g(l- s) 


du. 
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For a partition A:0 = to < • • • < t m = 1 (m G N), we consider D A as in 
Section 4. One has 

n 

n(D A ) = ^x 1 ]) 

= - f 1 ----- T -— du. 

2 Jo 1 — u + g( 1 — maxjtj: ij < u \) 

Next, choose a sequence of refining partitions (A n ) such that their mesh 
tends to 0. Then the term in the latter integral is monotonically increasing 
in n and convergent. Hence, one obtains 

lim n(D An ) = - / - - -- du. 

n^oo 2 Jo l-u + g(l — u) 

On the other hand, 

lim 7T (D An ) = tt(D) = ug(x) — u^ix). 

n —kx) 

Consequently, the insider has finite utility if and only if /g 1 du < oo 

is finite. Now suppose g(y ) = Cy p for some C > 0 and p > 0. It is straightfor¬ 
ward to show that the integral and, hence, the additional utility, is finite if 
and only if p G (0,1). This equivalence follows also from results in [8], where 
the authors compute explicitly the information drift. 
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